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In  1984,  lASSIST  begins  its  tenth  year  as  an  organi- 
zation.  While  preparing  this  issue,  I  looked  through  the 
earlier  publications.   In  one  sense,  we  are   still  dealing 
with  the  same  ideas:  how  to  establish  data  archives;  how  to 
manage,  store,  classify  and  retrieve  information;  how  to  create 
better  techniques  for  linking  archives  together  electronically; 
how  to  coordinate  our  efforts.   It  seems  likely  that  we  will  con- 
tinue to  have  these  concerns.   Jackie  McGee's  paper  on  the  indexing 
of  machine-readable  data  files,  originally  presented  at  the  lASSIST 
Conference  in  1983,  addresses  still  unresolved  questions  on  how  to  organ- 
ize and  locate  information. 


There  is  one  area  in  which  I  perceive  a  shift.   There  is  less  talk  nowadays 
of  the  "invisible  college."  The  image  of  the  archive  as  a  repository  of  valuable 
information  has  gained  a  footing.   An  archivist  in  an  academic  institution  is 
perceived  as  a  resource  whose  advice  and  assistance  will  form  an  integral  part  of 
a  research  project,  thesis  or  dissertation,  or  even  the  planning  of  courses. 
Researchers  rely  more  on  the  formal  organization  of  archives  and  less  on  private 
contacts.   Murray  Aborn's  paper,  also  originally  presented  at  the  I983  conference, 
proposes  a  further  shift  away  from  those  hoarded  collections  of  dusty  tapes,  and 
presents  possibilities  for  new  ways  to  manage  them. 

I  think  the  next  ten  years  will  be  productive  for  this  organization  if  we  do 
as  Dr.  Aborn  suggests,  and  turn  to  other  disciplines  which  may  benefit  our  work 
with  continuing  concerns  of  data  archive  management  as  well  as  expand  the  ways 
we  meet  future  challenges  to  support  social  science  research. 


Best  wishes 


Elizabeth  Stephenson,  Editor 
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In  the  1960's,  the  growing  influ- 
ence of  the  computer  caused  dramatic 
changes  to  take  place  in  the  concept 
of  scientific  data  and  the  character 
of  data  analysis.  Among  these  changes 
was  the  onset  of  a  shift  away  from 
single-purpose  data  collections  and 
analyses  based  on  relatively  small 
data  sets,  toward  large-scale  data 
collections  and  analyses  based  on 
data  banks  serving  multiple  applica- 
tions and  possessing  widely  accessible 
storage  and  retrieval  systems.  In 
social  science  this  led  to  the  esta- 
blishment of  data  archives  and  an 
early  attempt  to  regulate  their  func- 
tions. Additional  purposes  were  to 
keep  these  facilities  abreast  of  a 
rapidly  advancing  technology,  and 
enable  them  to  remain  au  oourant   with 
increasingly  sophisticated  management 
schemes  for  operating  over  larger  and 
larger  bodies  of  data  (1).  This  paper 
briefly  traces  the  role  of  the  National 
Science  Foundation  (NSF)  in  these 
developments,  discusses  the  current 
state  of  affairs  with  respect  to  social 
science  data  resources,  and  questions 
whether  continued  reliance  on  sheer 
data  amassment  is  the  true  path  to  the 
further  intellectual  progress  of  the 
field. 

THE  BUILDING  QF  NSF  DATA  PROGRAMS 

In  the  years  immediately  following 
the  advent  of  the  computer-based  data 


archive,  NSF  involvement  in  the  expan- 
sion and  upgrading  of  the  major  sources 
of  social  data  grew  and  intensified. 
To  increase  the  research-return  from 
the  enormous  investment  society  makes 
in  the  collection  of  social  statistics, 
projects  were  supported  to  enhance  the 
researcher  access  to  them.  To  fill 
the  gaps  in  the  social  science  data 
base,  projects  were  supported  to  main- 
tain data  series  not  covered  by  the 
federal  statistical  system  but  needed 
for  the  monitoring  of  social  and  eco- 
nomic trends  and  the  modeling  of  long- 
term  social  change.  Direct  support 
was  afforded  to  archival  facilities  to 
help  them  expand  their  holdings  and 
degray  the  costs  of  dissemination.  On 
the  user  side,  projects  were  supported 
to  increase  the  research  utilization 
of  stored  social  data  across  disci- 
plines, including  projects  to  intro- 
duce bibliographic-type  control  over 
machine-readable  data  files  in  order 
to  reduce  duplicate  data  collection 
and  help  prevent  incomplete  data  anal- 
yses. And  alongside  these  data  pro- 
grams ,  projects  were  funded  to  improve 
existing  methods  and  create  new  tools 
of  broad  utility  in  analyzing  the 
growing  stock  of  social  and  economic 
data  becoming  available  to  the  research 
community. 

In  1976,  a  committee  of  the  National 
Academy  of  Sciences  surveying  the 
social  sciences  at  NSF  acknowledged 


the  role  NSF  programs  were  playing  in 
the  sphere  of  data  resources.  It  de- 
clared: 

It  is  generally  felt,   and  refleoted 
in  the  long-range  plans  of  several 
of  the  social  science  programs  in 
NSF,    that  deficiencies  in  the 
available  base  of  social  science 
data  are  seriously  impeding  the 
progress  of  research   ( 2 ) . 

The  Committee  did  not  refute  this 
outlook;  in  fact,  it  ultimately  recom- 
mended that  such  planning  continue  and 
include  greater  support  for  longitud- 
inal studies  over  extended  time  periods 
and  national  facilities  for  survey 
research  and  large  data  bases. 

Financial  backing  for  the  data  pro- 
grams described  above  was  provided  not 
only  by  NSF's  Division, of  Social  and 
Economic  Science,  but  by  sections  in 
other  parts  of  the  Foundation,  such 
as  Computer  Science  and  Information 
Science.  Over  the  next  six  years, 
however,  Computer  Science  and  Infor- 
mation Science  turned  inward,  concen- 
trating on  their  own  disciplinary 
development  and  gradually  eschewing 
applicational  extensions  to  other 
fields  of  science.  But  in  social  sci- 
ence, the  work  went  on.  Programs  were 
maintained  that  to  this  day  continue 
to  build  a  data  resource  infrastruc- 
ture capable  of  sustaining  the  large 
empirical  research-  tradition  which 
characterizes  contemporary  social 
research. 

IS  IT  TIME  FOR  A  CHANGE  IN  ORIENTATION? 

The  National  Academy  of  Sciences 
committee  surveying  NSF's  social  sci- 
ence programs  in  1976  never  made  crys- 
tal clear  precisely  what  was  being 
referred  to  by  "deficiencies   in  the 
available  base  of  social  science  data," 
and  which  of  these  were  more  or  less 
responsible  for  "impeding  the  progress 
of  research."  It  is  pretty  apparent 
from  the  committee's  report,  however, 
that  data  resource  planning  in  social 
science  was  largely  oriented  toward 


filling  topical  gaps  and  producing 
lower  levels  of  aggregation,  larger- 
scale,  longer-term  data  gathering 
efforts,  and  a  more  systematic  approach. 
Though  the  importance  of  methodological 
accompaniments  to  assure  good  data 
quality  was  neglected  neither  in  NSF's 
programs  nor  the  committee's  report, 
the  effect  of  stressing  data  gaps  and 
data  shortfalls  inevitably  leads  to 
more  and  more  data  getting  collected 
and  more  and  more  data  being  retained-- 
and  that  is  exactly  what  has  happened. 

Now,  one  question  that  arises  at 
this  point  is  whether  an  orientation 
toward  data  amassment  has  had  negative 
as  well  as  positive  consequences.  The 
answer  is  "yes."  If  negative  conse- 
quences is  too  strong  a  term,  then  we 
can  at  least  speak  of  limiting  effects. 
And  if  there  have  been  negative  conse- 
quences or  limiting  effects,  then  it 
is  time  for  a  change  in  orientation. 
But  before  proceeding  to  describe  what 
the  required  change  appears  to  be,  it 
is  crucial  to  make  clear  that  such 
change  in  no  way  gainsays  the  compel- 
ling arguments  put  forth  in  many  recent 
publications  regarding  the  value  of 
secondary  analysis.  Nor  does  it  gain- 
say the  need  to  have  data  available  for 
reanalysis  in  order  to  test  for  bias  in 
reported  results,  challenge  data-driven 
theoretical  assertions,  and  generally 
carry  on  the  processes  of  scientific 
understanding  in  a  field  which  is 
rarely  able  to  conduct  controlled 
experiments  or  reproduce  the  original 
conditions  of  an  investigation.  A 
change  in  orientation  simply  argues  for 
diverting  some  amount  of  effort  and 
devoting  some  portion  of  available  re- 
sources to  study  the  deeper  aspects  of 
the  enterprise  in  which  the  field  has 
become  heavily  engaged. 

One  negative  consequence  of  the 
data  gathering  enterprise  has  been  the 
pejoration  of  the  term  "data."  This  is 
no  doubt  connected  with  the  fact  that 
the  enterprise  is  largely  concerned 
with  quantitative  data  in  computer- 
manipulable  form,  but  in  any  case  the 


term  data  is  now  commonly  used  inter- 
changeably with  the  terms  observations, 
information  and,  worst  of  all,  evidence. 
I  daresay  few  really  believe  that  data 
in  and  of  themselves  prove  anything, 
but  that's  the  way  we  have  come  to 
talk  and,  I  fear,  occasionally  think. 
However,  the  more  frequent  tendency  is 
to  confuse  data  first  with  observations 
and  then  with  information.  I  realize 
this  gets  pretty  elementary,  but  con- 
temporary social  science  data  archives 
contain  mostly  recorded  observations, 
not  data.  It  sounds  more  imposing  to 
speak  of  data   archives,  and  it  is  cer- 
tainly easier  to  raise  money  in  the 
name  of  data  than  it  is  for  just  plain 
old  observations,  but  the  terminology 
is  inaccurate.  Observations  become 
data  only  after  they  are  placed  in 
some  analytical  framework.  As  it  is 
obvious  from  the  general-purpose  nature 
of  the  data  archives,  the  same  obser- 
vations are  destined  to  be  interpreted 
as  more  than  one  kind  of  data. 

A  similar  confusion  prevails  with 
respect  to  data  and  information.  The 
two  are  not  synonomous ,  though  the 
exposition  here  is  yery   difficult  inas- 
much as  the  relation  is  inferential  and 
dependent  upon  the  application  of 
external  structures.  Simply  state,  it 
behooves  us  not  to  forget  that  any  body 
of  data  is  a  mixture  of  information  and 
noise,  and  that  the  proportions  will 
vary  according  to  the  use  to  which  the 
data  are  being  put. 

In  the  main,  the  signal  to  noise 
ratio  in  social  science  is  typically 
much  lower  than  in  the  physical  sci- 
ences, which  is  a  way  of  saying  that 
the  information  content  of  a  data  base 
can  be  very  meager,  particularly  when 
the  data  are  employed  to  test  hypothe- 
ses far  afield  from  the  hypotheses 
which  motivated  the  data  collection 
originally.   It  is  thus  ironic  that 
the  very  success  of  large-scale,  inte- 
grated data  bases  and  the  attendant 
data-processing  technology  often  leads 
to  a  confusion  of  the  technology  with 
the  natural  semantics  of  information. 


which  is  heavily  context-dependent. 
Thus  the  underlying  assumptions  appro- 
priate to  the  context  of  one  applica- 
tion may  be  totally  inappropriate  to 
the  contexts  of  other  applications. 
Moreover,  the  difficulty  is  compounded 
by  the  fact  that  in  their  research, 
social  scientists  are  heavily  depen- 
dent upon  data  files  which  were  not 
generated  for  scientific  purposes, 
such  as  census  data,  voting  records, 
police  and  court  records,  governmental 
budgets,  and  so  forth,  and  whose  infor- 
mational value  relative  to  the  kinds 
of  scientific  questions  social  scien- 
tists ask  may  be  completely  uncertain. 

BRINGING  INFORMATION  SCIENCE 

INTO  THE  PICTURE 

In  the  previous  section  of  this 
paper,  mention  was  made  of  the  ancil- 
lary role  played  by  the  computer  and 
information  sciences  in  the  building 
of  NSF  data  programs.  It  was  noted 
that  those  roles,  diminished  after 
1976,  and  that  NSF's  contributions  to 
the  data  resource  infrastructure  of 
present-day  social  science  has  been 
carried  on  exclusively  by  the  social 
science  elements  of  NSF.  This  situ- 
ation is  changing.  Given  recent 
advances  in  information  science,  it 
seems  particularly  important  to  begin 
to  apply  newly-formulated  principles 
of  knowledge  management  to  social 
science  data  resources  precisely 
because   their  holdings--observations 
of  social  and  behavioral  phenomena  in 
digital  form--tend  to  be  incomplete, 
imprecise,  and  error-prone  due  to  the 
fuzzy  nature  of  the  phenomena  being 
observed  and  the  looseness  of  the  data 
gathering  process.  Knowledge  manage- 
ment facilitates  the  translation  of 
user  needs  into  expressions  upon  which 
a  data  base  system  can  act.  One  exam- 
ple of  possible  applications  to  social 
data  is  the  development  of  data  base 
specification  languages,  that  is, 
languages  which  would  permit  social 
science  researchers  to  express  their 


requirements  in  functional  terms. 
These  might  then  be  translated  into 
a  database  format,  perhaps  based  on 
relational  structures  rather  than 
representational  ones,  as  is  the 
present  mode,  which  would  help  skirt 
the  data  dependence  problem. 

Other  areas  of  potential  applica- 
tion to  social  science  data  may  come 
from  information  science's  concern 
with  descriptive  classification,  in- 
dexing, and  the  problems  of  relating 
variant  terminology  in  a  single  re- 
trieval system.  The  current  work  of 
Dolby  is  an  example  (3).  Dolby  argues 
that  the  correctness  of  data  and  data 
analysis  involves  correctness  in  mean- 
ing, and  that  correctness  in  meaning 
goes  beyond  matters  of  computer  pro- 
gram correctness  or  the  numerical 
accuracy  of  data.  His  approach  con- 
centrates on  the  use  of  classification 
structures  to  extend  the  formal  treat- 
ment of  meaning  in  computer-based  data 
systems,  and  he  has  shown  how  such 
extensions  can  expose  or  reduce  ambi- 
guities and  inconsistencies  of  meaning 
in  such  systems. 

There  are  some  other,  more  practi- 
cal reasons  to  believe  that  the  time 
has  come  to  test  out  achievements  in 
information  science  as  they  may  be 
applied  to  stored  social  data.  Urgen- 
cies created  by  current  reductions  in 
the  quantity  of  social  science-usable 
data  generated  by  the  federal  statis- 
tical system  is  one  reason;  cutbacks 


in  the  funds  available  to  support 
scientifically  oriented  social  science 
data  resources  is  another.  It  would 
help  greatly  if  we  could  improve  our 
ability  to  estimate  the  degree  of 
redundancy  (i.e.,  the  amount  of  infor- 
mation overlap)  among  data  collections, 
and  if  we  could  make  progress  in  our 
ability  to  set  data  collection  and 
maintenance  priorities. 

Considering  the  potential  benefits 
of  bringing  information  science  into 
closer  contact  with  social  science  data 
problems  and  opportunities,  it  has  been 
decided  to  launch  an  initiative--still 
informal  at  this  juncture--to  make 
known  our  receptivity  to  proposals 
which  combine  or  merge  the  subject 
matters  normally  covered  by  social 
science  and  information  science  inde- 
pendently. Such  proposals  will  be 
handled  jointly  by  NSF's  Division  of 
Social  and  Economic  Science  and  its 
Division  of  Information  Science  and 
Technology  (4).  The  Division  of 
Social  and  Economic  Science  supports 
the  establishment,  evaluation,  and 
improvement  of  social  science  data 
resources,  research  on   social  data, 
and  the  development  of  methods  for 
analyzing  such  data.  The  Division  of 
Information  Science  and  Technology 
supports  research  to  increase  under- 
standing of  the  properties  of  infor- 
mation transfer.  We  believe  the 
future  will  show  that  this  initiative 
was  well  advised. 
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INTRODUCTION 

In  the  recent  past  much  has  been 
written  and  discussed  about  the  pro- 
blems of  cataloging  and  bibliographic 
control  of  social  science  data.  Many 
of  these  problems  may  have  been  re- 
solved with  the  implementation  of  the 
Angle-American  Cataloging  Rules  II, 
Chapter  9  (AACRII)  and  the  MARC  format 
for  bibliographic  control  (2).  How- 
ever, there  are  a  number  of  reasons 
these  solutions  may  not  yet  be  univer- 
sally implemented. 

For  instance,  the  AACRII  and  the 
MARC  format  may  be  very  familiar  to 
library  staff,  but  all  archives  are 
not  staffed  by  librarians.  Many 
archives  are  suffering  from  a  shortage 
of  staff  and  financial  resources. 
Federal  agencies  produce  a  major  por- 
tion of  the  data  archived  and  used  for 
secondary  analysis  and  these  agencies 
are  also  financially  depressed. 
Researchers  and  programmers  who  use 
these  Machine-Readable  Data  Files 
(MRDF)  are  not  as  aware  of  the  pro- 
blems related  to  the  acquisition  or 
storage  of  data  and  their  interests 
do  not  necessarily  correspond  to  the 
interests  of  the  data  archivist. 


Technological  changes  occur  so  fre- 
quently procedures  may  become  obsolete 
by  the  time  implementation  occurs.  And 
finally,  so  many  new  commercial  firms 
are  installing  social  science  numeric 
data  bases  online  and  the  interests  of 
these  firms  do  not  lie  in  the  same 
directions  as  those  of  the  data  archi- 
vist. To  assist  the  novice  who  may  be 
overwhelmed  by  some  of  these  problems, 
it  is  the  hope  of  the  author  this 
paper  will  provide  some  examples  of 
simple  record  keeping. 

Rowe  and  Byrum  previously  described 
a  user-oriented  system  for  the  docu- 
mentation and  control  of  MRDF  (3). 
This  system  was  comprised  of  four  parts. 
First,  a  standard  catalog  entry,  second, 
a  data  abstract  or  description  form, 
third,  documentation  codebooks  and 
lastly,  the  records  of  physical  and 
logical  characteristics  of  the  data 
set.  It  is  not  the  purpose  of  this 
paper  to  offer  an  alternative  system 
for  the  documentation  and  control  of 
MRDF  but  to  provide  a  practical  example 
of  implementing  such  a  system.  This 
example  will  provide  an  illustration 
for  the  person  who  has  just  received 


responsibility  for  the  safekeeping  of 
a  collection  of  MRDF  or  to  establish 
an  archive  and  isn't  sure  where  to 
begin.  The  first  item  in  the  system 
by  Rowe  and  Byrum,  the  standard  cata- 
log entry,  was  described  before  the 
Anglo-American  Catalog  Rules  II, 
Chapter  9  were  implemented.  Data 
librarians  located  in  a  traditional 
library  are  already  familiar  with  the 
rules  for  cataloging,  but  may  not  be 
familiar  with  the  AACRII,  Chapter  9. 

The  Anglo-American  Cataloging  Rules 
II,  Chapter  9  describes  the  standard 
rules  for  cataloging  MRDF.  It  is  not 
within  the  scope  of  this  paper  to 
argue  the  pros  and  cons  of  the  accept- 
ability of  the  AACRII.  There  can  be 
no  doubt  that  a  uniform  standard  de- 
fining MRDF  is  necessary  in  order  to 
alleviate  present  confusing  practices 
and  the  proliferation  of  titles  for 
one  data  file.  Certainly  implementa- 
tion of  the  AACRII  and  the  agreement 
of  the  MARC  format  were  giant  strides 
in  the  cataloging  and  bibliographic 
control  of  MRDF. 

STANDARD  CATALOG  ENTRY 

Rowe  and  Byrum  state  "Standard 
catalog  entries,  constitute  the  pri- 
mary records  by  which  computer- 
readable  data  files  should.be  con- 
trolled and  accessed."  It  is  with 
this  one  area  of  their  discussion  that 
I  disagree  slightly.  The  standard 
catalog  entry  requires  extensive 
staff  time  and  financial  resources 
and  need  only  be  considered  as  neces- 
sary under  certain  conditions;  if  the 
required  resources  are  available  and 
may  be  allocated  to  such  an  endeavor; 
if  the  data  archive  or  data  bank  is 
situated  in  a  library  or  a  library  is 
available  and  willing  to  participate; 
if  the  data  holdings  are  original 
data  from  the  institution  responsible 
for  the  establishment  of  the  archive. 

It  is  hoped  non-originating 
archived  data  will  be  catalogued  by 
the  originating  institution.  However, 
the  federal  government  is  responsible 


for  a  major  portion  of  the  data  files 
held  in  many  archives  and  current  fis- 
cal restraints  on  most  federal  agencies 
probably  will  not  permit  such  a  project 
in  the  near  future.  There  is,  however, 
an  ongoing  cataloging  project  at 
Michigan's  Inter-University  Consortium 
for  Political  and  Social  Research 
(ICPSB)  which  may  resolve  the  problem 
of  cataloging  federal  data  (4).  ICPSR 
is  certainly  one  of  the  largest,  if 
not  the  largest,  of  the  data  archives 
in  the  United  States.  When  this  pro- 
ject to  catalog  their  holdings  is 
complete,  it  may  be  possible  to  consi- 
der a  union  catalog. 

DATA  ABSTRACT  OR  DATA  DESCRIPTION  FORM 

It  is  the  Data  Abstract  or  Data 
Description  Form  described  by  Rowe 
and  Byrum  which  should  be  given  prior- 
ity in  the  development  of  an  archive 
record  system.  The  data  abstract  or 
data  description  form  in  a  standard 
format  is  an  absolute  necessity  and 
should  be  the  core  of  the  documentation 
for  the  archiving  of  MRDF. 

Aldrich  has  proposed  a  similar 
Abstract  Form  for  the  documentation 
of  federal  MRDF  (5).  It  was  from  a 
description  by  Aldrich  the  following 
example  was  derived.  Changes  made  in 
the  form  were  for  the  benefit  of  the 
user  and  do  not  reflect  a  disagreement 
with  her  proposed  standards.  Since 
this  form  includes  an  abstract  summar- 
izing the  data  set  or  file  being 
archived,  this  document  shall  be 
referred  to  here  as  a  Data  Base  Pro- 
file. With  some  slight  variations, 
the  items  contained  in  the  form  gener- 
ally should  contain  the  items  shown 
on  the  next  page. 

If  it  is  not  possible  or  feasible 
for  the  archive  or  library  to  catalog 
the  holdings  of  the  archive  according 
to  AACRII  at  least  the  information 
supplied  in  the  Abstract  or  Data  Base 
Profile  form  will  conform  to  the  stan- 
dards for  describing  MRDF.  If  at 
some  future  time  cataloging  is  possi- 
ble, the  information  for  the  catalog 


Abstract  Form 

for  the  Docwnentation  of  Federal  MEDF 

FILE  #: 

an  identifying  number  for  the  individual  archive 

FILE  NAME: 

a  title 

FILE  SOURCE: 

producer,  distributor,  processor 

PRINCIPLE  INVESTIGATOR: 

primary  researcher 

TYPE  OF  FILE: 

survey  data,  microdata,  administrative  records, 
process  records,  geographic  records,  software 

UNIVERSE: 

total  universe  the  records  describe 

SAMPLE  SIZE: 

number  of  observations  or  records 

SAMPLE  UNIT: 

household,  person  or  unit  being  measured 

RESTRICTIONS: 

none,  or  any  restrictions  placed  on  the  distribution 

ABSTRACT: 

a  summary  description  of  the  data  set  or  file. 
Each  abstract  held  in  the  archive  should  contain 
the  same  information.  Care  should  be  taken  not  to 
omit  any  portion  of  the  required  information  and 
the  information  should  appear  as  closely  as  possible 
in  the  same  paragraph.  This  assures  an  easier 
search  for  the  individual . looking  for  specific 
information  as  to  size  of  the  sample,  purpose  of 
the  study,  key  variables,  etc. 

REFERENCES: 

a  descriptive  listing  of  the  hard-copy  documentation 
available  for  use  with  the  data  file,  i.e.,  code- 
books,  survey  instruments,  dictionaries,  etc. 

RELATED  PRINTED 

REPORTS: 

known  reports  where  the  data  is  described  or  where 
the  data  file  has  been  used 

TAPE  SPECIFICATIONS: 

the  physical  characteristics  of  the  data  file,  the 
tape  numbers,  the  logical  record  length,  the  block- 
size,  data  set  names  and  density 

entry  will  be  readily  available. 

Copies  of  the  Data  Base  Profile 
may  be  stored  in  computer  format  as 
well  as  in  hard-copy.   If  the  data  is 
stored  in  computer  format  it  would  be 
possible  to  devise  a  simple  online 
search  capability.  If  the  data  libra- 
rian wishes  to  be  bibliographically 
correct,  study  the  AACRII  and  include 
in  the  Profile  the  pertinent  informa- 
tion from  the  AACRII  as  well  as  the 
information  required  or  deemed  neces- 
sary for  the  institution  housing  the 
data  library  (7).  Many  of  the  ele- 
ments of  the  Data  Base  Profile  may  be 
utilized  to  produce  a  catalog.  For 
instance,  each  abstract  when  extracted 
from  the  Profile  provides  summaries 
of  the  archive  holdings.  The  Rand 
Data  Facility  catalog  uses  the 
abstracts  in  such  a  way. 

Each  abstract  then  written,  there- 
fore, should  include  the  following 
(6): 

•  Data  base  identification  number 

•  Source 

•  Name 

•  Date  the  information  was  collected 

•  Subject 

•  Geographic  level  of  the  data 

(lowest) 

•  Population  or  sampling  unit 

•  Number  of  observations  or  number 

of  logical  records 

•  Key  variables 

Indicies  may  then  be  derived  from  the 
information  given  on  a  Data  Base  Pro- 
f  i  Te . 

SOURCE  AND  NAME  INDEX 

The  Source  and  Name  Index  is  de- 
rived from  the  File  Name  and  the  File 
Source  as  given  in  the  Data  Base  Pro- 
file. Often  these  items  are  sorted 
as  separate  indices;  an  author  index 
and  a  title  index. 

An  index  should  lead  a  user  to  the 
information  he  is  seeking  with  as 


little  effort  as  possible,  and  so  we 
have  combined  these  indices. 

On  the  Data  Base  Profile  and  in 
data  file  records  we  use  the  most 
correct  name  for  a  data  file.  The 
correct  name  may  be  drived  by  using 
the  AACRII  rules.  Since  we  also  wish 
our  individual  indices  to  assist  the 
user  in  his  search  we  also  include  in 
our  local  archive  index  those  aliases 
or  acronyms  when  they  are  commonly 
used,  if  the  index  is  to  prove  useful. 

In  order  not  to  have  a  great  many 

"see ."  included  in  the  index 

where  aliases  or  acronyms  or  common 
usage  names  are  listed,  the  correct 
identifying  number  of  a  particular 
data  file  is  used  as  the  pointer. 

KEYWORD  INDEX 

This  index  is  certainly  one  of  the 
most  difficult  to  construct.  A  the- 
saurus would  be  helpful;  however,  the 
keyword  index  discussed  here  was  deve- 
loped from  individual  data  files.  As 
mentioned  earlier,  one  of  the  mandi- 
tory  sections  of  the  Data  Base  Profile 
is  a  list  of  key  variables.  At  the 
time  of  archiving  a  new  data  file,  the 
abstract  is  written  and  the  key  vari- 
ables extracted.  These  variables  or 
subject  categories  are  added  to  the  end 
of  the  keyword  index.  A  copy  of  the 
keyword  index  is  kept  online.  New  key- 
words are  added  at  the  end  of  the  old 
index  and  a  short  SAS  program  sorts  the 
keyword  index  by  words  or  by  identify- 
ing file  numbers  whenever  necessary. 

GEOGRAPHIC  LEVELS  AND  MAJOR  SUBJECT 
VARIABLES  IN  THE  KEYWORD  INDEX 

Using  Census  Bureau  designations, 
the  lowest  geographic  level  of  the 
data  is  assigned  as  a  second  element 
of  the  keyword  index.  By  using  the 
lowest  geographic  level  it  is  possible 
when  searching  for  data  to  weed  out 
those  data  files  not  useful  to  the 
researcher. 

A  third  element  for  the  keyword 
index  is  a  prescribed  list  of  major 
subject  variables.  For  each  data  file 
the  keyword  index  will  include  at  least 


one,  but  not  more  than  three  major 
subject  categories.  It  is  then  possi- 
ble to  produce  tables  listing  data 
files  by  geographic  areas  and  major 
subject  categories.  Librarians  may 
want  to  use  the  Library  of  Congress 
Subject  Headings  (LCSH)  List. 

The  Data  Base  Profiles  may  be 
produced  as  printed  copies  to  be 
given  to  researchers  interested  in 
using  a  particular  data  file  and  can 
be  used  as  documentation  for  bibli- 
ographic citations  in  research  papers 
for  the  creating  of  a  catalog  of 
holdings. 

The  Data  Base  Profiles  stored  in 
a  partitioned  data  set  at  Rand  are 
soon  to  be  converted  to  a  total  on- 
line system  using  the  IBM  INFO/SYS 
CSD  data  retrieval  system. 

The  OZ  INFO/SYS  has  the  capability 
of  handling  multiple  data  bases  and 


has  been  utilized  at  Rand  recently  to 
create  an  information  system  for  a 
library  of  software  models  as  used  in 
one  department  (8). 

With  the  data  stored  in  OZ  it  will 
be  possible  to  search  the  system  for 
a  particular  data  base  by  title,  by 
keword,  or  keyword  combinations.  Re- 
questing data  bases  by  keywords  pro- 
duces a  "hit  list"  of  all  data  bases 
that  contain  the  requested  keywords. 
The  hit  list  can  then  be  accessed 
in  either  of  two  forms:  one  is  an 
abreviated  form  that  lists  only  a 
few  specifics  about  the  data  bases 
on  the  list  (including  the  data  base 
number  and  title);  the  other  is  a 
full  description  of  the  data  base. 
It  is  possible  to  get  print  copies 
of  an  OZ  screen,  of  the  individual 
data  base  descriptions  and  of  the 
hit  list  of  keywords. 
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NEW  DIRECTOR 
APPOINTED-SSRC 

Dr.  Howard  Newby  has  been  appointed 
Director  of  the  SSRC  Data  Archive.  He 
will  take  up  the  post  in  October  of 
this  year,  and  will  hold  a  Professor- 
ship in  the  Department  of  Sociology 
at  the  University  of  Essex. 

Dr.  Newby 's  main  interests  have 
been  in  the  fields  of  rural  sociology, 
social  stratification  and  comnunity 
studies.  Me  has  published  extensively 
in  each  of  these  fields,  including 
The  Deferential  Worker   (Penguin,  1979), 
Property,    Paternalism  and  Power 
(Hutchinson,  1978),  and  Green  and 
Pleasant  Land?   (Penguin,  1980).  He 
is  also  co-editor  of  Doing  Sociological 
Research   (Allen  and  Unwin,  1977). 

In  the  past.  Dr.  Newby  has  under- 
taken research  for  the  Countryside 
Commission  and  is  currently  working 
on  a  study  of  countryside  access  for 
the  Countryside  Commission  and  the 
Sports  Council.  He  is  also  co- 
principal  investigator  of  a  large 
scale  SSRC-financed  study  of  "Economic 
Stratification  and  the  Social  Struc- 
ture. " 

Dr.  Newby  has  held  visiting  appoint- 
ments at  the  University  of  New  South 
Wales  and  the  University  of  Sydney. 
He  was  formerly  Professor  of  Sociology 
and  Rural  Sociology  at  the  University 
of  Madison-Wisconsin  in  the  USA. 

Reprinted  from:   SSRC  DATA  ARCHIVE 
BULLETIN,   No.    26,   Maroia  Taylor,   ed. 


PRESS  RELEASE 


STEIN  ROKKAN  PRIZE  AWARDED 

The  1983  Stein  Rokkan  Prize  in 
Comparative  Social  Science  Research, 
instituted  by  the  International 
•Social  Science  Council  and  the  Con- 
junto  Universitario  Candido  Mendes, 
has  been  awarded  by  the  Executive 
Committee  to  Professor  Jens  Alber  of 
the  Research  Institute  of  Sociology, 
University  of  Cologne,  for  his  book 
Von  Armenhaus  zim  Wohlfahrtsstaat : 
Analysen  zur  Entwicklung  der  Sozial- 
versicherung  in  Westeuropa    (From 
Poorhouse  to  Welfare  State:  Analyses 
of  the  Development  of  Social  Insurance 
in  Western  Europe)  and  his  article: 
Einige  Grundlagen  und  Begleiter- 
scheinungen  der  Entwioklung  der 
Sozialausgahen  in  Westeuropa,    1949- 
1977   (Some  Causes  and  Consequences 
of  Social  Security  Expenditure  Dev- 
elopment in  Western  Europe,  1949- 
1977). 

The  jury,  chaired  by  Professor 
Rudolf  Wildenmann,  upon  whose  recom- 
mendation the  Prize  was  awarded,  con- 
sidered that  "Alber' s  book  is  an 
excellent  historically  based  study  of 
the  development  of  social  security  in 
Western  Europe.  Historical  develop- 
ments are  skillfully  linked  with 
social  science  theories  and  explana- 
tions, which  are  then  tested  against 
empirical  data  and  analysis--in  terms 
of  the  relative  importance  of  insti- 
tutions, and  of  the  value  of  socio- 
economic versus  political  influences." 
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The  1984  Conference  is  to  be  held  at  the  Park  Lane  Hotel  in  downtown 
Ottawa  from  Monday,  May  14  to  Friday,  May  18.  The  theme  of  the  con- 
ference will  be  "Coming  of  Age  in  the  Brave  New  World."  Sub  themes 
will  address  a  variety  of  topics  of  interest  to  social  scientists, 
data  archivists,  librarians,  research  administrators,  computer 
specialists,  and  government  records  administrators.  Papers  will 
be  presented  in  three  general  areas: 

1.  Coping  with  Big  Brother  (Privacy  and  Confidentiality) 

2.  Pursuing  Technological  Nirvana  (The  Advance  of  Technology) 

3.  The  Information  Empire  (Roles  and  Responsibilities) 

Proposals  for  papers,  along  with  abstracts,  should  be  forwarded  to 
the  Programme  Chairperson. 


A  number  of  half-day  and  full  day  workshops  covering  such  issues  as 
"Planning  a  Data  Library,"  "Complex  Data  Structures,"  and  "Micro- 
Computer  Applications"  will  be  offered  on  Tuesday,  May  15.  Time  has 
also  been  set  aside  for  Association  business  including  committee 
meetings  and  the  Annual  General  meeting. 


The  members  of  the  Local  Arrangements  Committee  are  ensuring  that  there  will 
be  sufficient  time  to  socialize  and  to  enjoy  the  pleasures  that  Ottawa  has  to 
offer.  As  an  example,  the  Conference  has  been  timed  to  provide  participants 
with  the  opportunity  to  enjoy  the  annual  Festival  of  Spring.  In  addition. 
Committee  members  are  planning  a  number  of  receptions,  tours  and  other  social 
activities  designed  to  make  your  stay  in  Ottawa  an  enjoyable  one.  For  infor- 
mation about  local  arrangements  please  contact  John  McDonald,  c/o  Machine 
Readable  Archives  Division,  Public  Archives,  Ottawa,  Ontario,  KIA  0N3, 
(613)  593-7772. 

For  further  information  on  our  planning  activities  please  write 
or  call : 

Harold  Naugler  (Programme  Chairperson) 

c/o  Machine  Readable  Archives  Division 

Public  Archives  of  Canada 

395  Wellington  Street 

Ottawa,  Ontario 


KIA  0N3    CANADA 
(613)  593-777? 


\. 
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DATA  HANDBOOK  SERIES 

of  the  Center  for  Historical  Social  Research 
(ISSN:  0723-8371) 

This  series  publishes  data  available  also  in  machine-readable  form  from  the 
Center. 

NOW  AVAILABLE: 

Volume  1   Frank  R.  Phfetsch:  Datenhandbuch  zur  Wissenschaftsentwicklung  -  Die 
staatliche  Finanzierung  der  Wissenschaft  in  Deutschland  1850-1975. 

This  handbook  contains  time  series  data  on  science  expenditures  by 
the  central  government  and  the  states  of  Germany  1850-1975  sub- 
divided into  categories  such  as  ministries,  institutions,  functions, 
disciplines,  activities,  goals,  and  orientations. 

ISBN  3-923876-00-9,  1983 

5bO  pages  (including  490  pages  of  tables),  paperback, 

DM  60,/$24  (including  surface  postage  and  handling) 

Volume  2   Heinrich  Best:  Die  Abgeordneten  der  Assemblee  Nationale  Constituante 
1848-49  -  Sozialprofil  und  legislatives  Verhalten 

This  handbook  documents  the  biographical  and  career  data  of  the  over 
900  members  of  the  Assemblee  Nationale  Constituante,  Paris  1848-9, 
and  their  voting  behavior  in  124  roll  calls  in  a  codebook  format 
(codes  and  frequency  distributions) 

ISBN  3-923876-01-7,  1983 

132  pages  (including  120  pages  of  tables),  paperback 

DM  24,/$10  (including  surface  postage  and  handling) 


Please  send  me: 


order  form 


cop(ies)  of  PFETSCH,  Wissenschaftsentwicklung 
1850-1975,  1983,  550  pages,  pb.  DM  60,/$24.00. 

cop(ies)  of  BEST,  Assemblee  Nationale  Constituante 
1848/49,  1983,  132  pages,  pb.  DM  25,/$10.00. 


MAIL  TO: 

ZENTRUM  FUR  HISTORISCHE 

SOZ I AL  FORSCHUNG 

UN I VERS  I TXTSSTR .  2  0 

D-5  0  00  kOLN  4  1 

WEST    GERMAIVTY 
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DATA  AND  PROGRAM  LIBRARY  SERVICE  ANNOUNCEMENT: 

OCCUPATIONAL  CHANGES 

IN  A    GENERATION,    1973 

NEW  DATA  FILES  NOW  AVAILABLE 

In  August,  1973  the  U.S.  Bureau  of  the  Census  conducted  a  survey,  Occupational 
Changes  in  a  Generation   (0C6) ,  on  behalf  of  David  L.  Featherman  and  Robert  M. 
Hauser,  to  explore  the  effects  that  peoples'  backgrounds,  schooling,  military 
service,  and  early  work  experiences  have  on  their  careers.  The  target  sample 
was  men  aged  20  to  65,  but  some  of  the  same  data  were  ascertained  for  wives  of 
married,  spouse-present  men.  The  study  was  a  replicate  and  extension  of  the 
1962  Occupational  Changes  in  a  Generation  Survey   conducted  by  the  U.S.  Bureau  of 
the  Census  on  behalf  of  Peter  M.  Blau  and  Otis  Dudley  Duncan  to  examine  the 
extent  and  sources  of  social  mobility  in  the  U.S.  The  1973  study  was  conducted 
as  a  supplement  to  the  1973  March  Current  Population  Survey   (CPS). 

The  DPLS  archive  has  already  released  two  data  files  in  the  OCG  series:  Occupa- 
tional Changes  in  a  Generation^    1962   and  Occupational  Changes  in  a  Generation: 
Replicate  Master  File,    1962  and  197S.      The  OCG  1973   study  contains  all  the  source 
materials  for  the  1973  segment  of  the  Replicate  Master  File,   supplemented  by 
variables  and  codes  that  were  not  replicates  of  those  in  the  1962  OCG  file.  Data 
files  scheduled  for  release  after  January  1,  1984  include: 

Occupational  Changes  in  a  Generation,    1972 
(March  1973  CPS  and  1973  OCG) 

Occupational  Changes  in  a  Generation,    1972: 

Replicate  Sample  Weights 

(March  1973  CPS,  1973  OCG,  and  replicate  sample  weights) 

Much  of  the  documentation  accompanying  the  Occupational  Changes  in  a  Generation: 
Replicate  Master  File   has  been  reproduced  and  is  augmented  by  technical  notes  on 
data  quality.  The  data  file  and  documentation  for  OCG,  1972   can  be  obtained  for 
about  $123  documentation  alone,  $58.   The  OCG,    1972:   Internal  Sample  Weights 
can  be  obtained  for  $254.  The  cost  includes  xeroxing,  staff  time,' magnetic 
tape(s),  computer  time,  postage,  and  a  maintenance/preservation  fee.  The  data 
file  can  be  written  to  user  specifications,  in  BCD,  EBCDIC,  or  ASCII.  For 
further  information,  contact: 

The  Data  and  Program  Library  Service 

3133  Social  Science  Building 

University  of  Wisconsin-Madison 

Madison,  Wisconsin  53706 

(Tel.  No.:  608/262-7962) 
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^\  EUSIDIC 

^"^  DATABASE 

762  BIBLIOGRAPHIC  FILES  G  U  I  DE 

1083  DATABANKS  iQQQ 

983  SUPPLIERS.  VENDORS.  C/OO 

NETWORKERS  AND  BROKERS 

2828  DIRECTORY  ENTRIES  (an  increase  of  500  on  the  1981  edition) 


The  EUSIDIC  DATABASE  GUIDE,   1983  is   the  international   directory  to  ewery 
machine-readable  database  available,  or  potentially  available,  to  the  European 
public.     The  Guide  lists  all   reported  European  operations,  whether  online  or 
in  batch  mode,  and  all  reported  non-European  operations  and  services  networked 
in  Europe. 


PUBLISHED   DECEMBER   1982;    PRICE        25.00 
LEARNED    INFORMATION.    LTD. 
BESSELSLEIGH    ROAD.    ABINGDON. 
OXFORD.    ENGLAND 


UrDaTe 


In  the  last  issue  of  the  Quarterly  (Fall,  1983),  we  reprinted  an  article 
from  DATA  USER  NEWS  on  the  population  of  Mexico.  The  report  titled  DETAILED 
STATISTICS  ON  THE  URBAN  AND  RURAL  POPULATIONS  OF  MEXICO,  1950-2010  has  been 
reduced  in  price  and  may  now  be  obtained  from  the  Center  for  International 
Research,  Bureau  of  the  Census,  Washington,  D.C.,  20233  for  $10.00. 
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lASSIST 


MEMBERSHIP  AND  SUBSCRIPTION  FEES 


The  International  Association  for  Social  Science  Information 
Services  and  Technology  (lASSIST)  is  a  professional  association 
of  individuals  who  are  engaged  in  the  acquisition,  processing, 
maintenance,  and  distribution  of  machine  readable  text  and/or 
numeric  social  science  data.  The  membership  includes  information 
systems  specialists,  data  base  librarians  or  administrators, 
archivists,  researchers,  programmers,  and  managers.  Their  range 
of  interests  encompasses  hardcopy  as  well  as  machine  readable 
data. 


Paid-up  members  enjoy  voting  rights  and  receive  the  lASSIST 
QUARTERLY.  They  also  benefit  from  reduced  fees  for  attendance 
at  regional  and  international  conferences  sponsored  by  lASSIST. 


Membership  fees  are: 


REGULAR  MEMBERSHIP: 
STUDENT  MEMBERSHIP: 


$20  per  calendar  year 
$10  per  calendar  year 


Institional  subscriptions  to  the  QUARTERLY  are  available,  but  do 
not  confer  voting  rights  or  other  membership  benefits. 


INSTITUTIONAL  SUBSCRIPTION; 


$35  per  calendar  year 
(includes  one  volume  of 
the  QUARTERLY) 


Ms.  Jacqueline  McGee 

The  Rand  Corporation 

1700  Main  Street 

Santa  Monica,  California 

U.S.A.    90406 

(213)  393-0411 


lASSIST 


MEMBERSHIP   APPLICATION 


Please  accept  my  application  for  membership  in  lASSIST: 

PHONE  ] ) 


NAME 

INSTITUTION  AFFILIATION 
MAILING  ADDRESS 


ZIP/POSTAL  CX)DE 


I  am  requesting: 


□  REGULAR  MEMBERSHIP  .  S20.00 
D  STUDENT  MEMBERSHIP  ,  $10.00 
n   INSTITUTIONAL  MEMBERSHIP   $35.00 


MAKE  CHECKS  PAYABLE 
TO  I ASS  I  ST 


My  primary  interests  are: 

□  ARCHIVE  SERVICES/ADMINISTRATION 
n  DATA  PROCESSING/ DATA  MANAGEMENT 

□  RESEARCH  APPLICATIONS 

□  OTHER   ( SPEC  I  FY )  


Q   PLEASE  INCLUDE  MY  NAME  IN  THE  lASSIST  MEMBERSHIP  DIRECTORY 


MS.  JACKIE  MCGEE 

THE  TREASURER,   lASSIST 

RAND  CORPORATION 

170  0  MAIN  STREET 

SANTA  MONICA,  CA   9040  6 
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ADMINISTRATIVE  COMMITTEE 
President:   Sue  Gavrel,  Machine  Readable  Data  Archives,  Public  Archives  of  Canada, 
395  Wellington  Street,  Ottawa,  Ontario  KIA  0N4,  Canada 

Vice  President:  Henk  Schrik,  Steinmetz  Archives,  Herengracht,  410-412,  1017  BX 
Amsterdam,  The  Netherlands. 

REGIONAL  SECRETARIES 

Asia:   Naresh  Nijhawan,  Indian  Council  of  Social  Science  REsearch,  Data  Archive 
35  Ferozshah  Road,  New  Delhi,  11001,  India 

Australia:   Terence  Beed,  Sample  Survey  Centre,  University  of  Sydney,  Sydney,  New 
South  Wales,  Australia  2006 

Canada :   Wendy  Watkins ,  Social  Science  Data  Archive,  Carleton  University,  Ottawa, 
Ontario  KIS  5B6,  Canada 

East  Europe:  Bartlomiej  Gasiorowski,  Komitet  Nauk  Polityeznych,  PAN  Danych  skrytka 
pocztowa  12,00-955  Warszawa,  Poland 

West  Europe:   Erika  Von  Brunken,  Medical  Information  Center,  MIC-KIBIC,  Box  60201, 
S-104  01  Stockholm,  Sweden 

United  States:   Judith  S.  Rowe,  Computer  Center,  87  Prospect  Street,  Princeton 
Univeristy,  Princeton,  New  Jersey  08544,  U.S.A. 

MEMBERS-AT-LARGE 

John  de  Vries,  Department  of  Sociology,  Carleton  University,  Ottawa,  Ontario 
KIS  5B6,  Canada 

Sue  A.  Dodd,  Social  Science  Data  Library,  Manning  Hall,  University  of  North 
Carolina,  Chapel  Hills,  North  Carolina  27514,  U.S.A. 

Carolyn  Geda,  Inter-University  Consortium  for  Political  and  Social  Research 
P.O.  Box  1248,  University  of  Michigan,  Ann  Arbor,  Michigan  48106,  U.S.A. 

Jackie  McGee,  The  RAND  Corporation,  1700  Main  Street,  Santa  Monica,  California 
90406,  U.S.A. 

Nancy  Carmichael  McManus,  Social  Science  Research  Council,  1438  Corcoran  Street, 
Nl^,  Washington,  D.C.   20009,  U.S.A. 

Ekkehard  Mochmann,  Zentralarchiv  fur  Empirische  Sozialforschung,  University  of 
Cologne,  Bachemerstrasse  40,  D-5000  Cologne  41,  Federal  Republic  of  Germany 

David  Nasatir,  California  State  University,  Domiguez  Hills,  California  90747, 
U.S.A. 

Per  Nielsen,  Danish  Data  Archives,  Niels  Bohrs  Alle  25,  DK-5230  Odense  M.  Denmark 

Laine  G.  M.  Runs,  Data  Library,  University  of  British  Columbia,  2075  Wesbrook  Hall, 
Vancouver,  British  Columbia,  V6T  1W5,  Canada 

Don  Trees,  VIAR  and  Company,  300  N.  Lee  Street,  Alexandria,  Virginia   22314, 
U.S.A. 

EX  OFFICIO 

Treasurer :  Ms.  Jackie  McGee,  The  F^and  Corporation,  1700  Main  Street,  Santa 
Monica,  California   90406,  U.S.A. 

Editor:  Elizabeth  Stephenson,  Institute  for  Social  Science  Research,  University 
of  California,  405  Hllgard  Avenue,  Los  Angeles,  California  90024,  U.S.A. 


